MaturePred: Efficient Identification of MicroRNAs within Novel Plant Pre-miRNAs

نویسندگان

  • Ping Xuan
  • Maozu Guo
  • Yangchao Huang
  • Wenbin Li
  • Yufei Huang
چکیده

BACKGROUND MicroRNAs (miRNAs) are a set of short (19∼24 nt) non-coding RNAs that play significant roles as posttranscriptional regulators in animals and plants. The ab initio prediction methods show excellent performance for discovering new pre-miRNAs. While most of these methods can distinguish real pre-miRNAs from pseudo pre-miRNAs, few can predict the positions of miRNAs. Among the existing methods that can also predict the miRNA positions, most of them are designed for mammalian miRNAs, including human and mouse. Minority of methods can predict the positions of plant miRNAs. Accurate prediction of the miRNA positions remains a challenge, especially for plant miRNAs. This motivates us to develop MaturePred, a machine learning method based on support vector machine, to predict the positions of plant miRNAs for the new plant pre-miRNA candidates. METHODOLOGY/PRINCIPAL FINDINGS A miRNA:miRNA* duplex is regarded as a whole to capture the binding characteristics of miRNAs. We extract the position-specific features, the energy related features, the structure related features, and stability related features from real/pseudo miRNA:miRNA* duplexes. A set of informative features are selected to improve the prediction accuracy. Two-stage sample selection algorithm is proposed to combat the serious imbalance problem between real and pseudo miRNA:miRNA* duplexes. The prediction method, MaturePred, can accurately predict plant miRNAs and achieve higher prediction accuracy compared with the existing methods. Further, we trained a prediction model with animal data to predict animal miRNAs. The model also achieves higher prediction performance. It further confirms the efficiency of our miRNA prediction method. CONCLUSIONS The superior performance of the proposed prediction model can be attributed to the extracted features of plant miRNAs and miRNA*s, the selected training dataset, and the carefully selected features. The web service of MaturePred, the training datasets, the testing datasets, and the selected features are freely available at http://nclab.hit.edu.cn/maturepred/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of microRNAs in corpus luteum of pregnancy in buffalo (Bubalus bubalis) by deep sequencing

This study was aimed to identify miRNAs of corpus luteum (CL) in buffaloes during pregnancy. For this study, CL (n=2) were collected from gravid uteri of buffalo and RNA was isolated. Following this, the purity and integrity of RNA was checked and used for deep sequencing using Illumina Hiseq 2500 platform. The reads’ quality was checked prior to in silico analyses viz. identification of conser...

متن کامل

miRLocator: Machine Learning-Based Prediction of Mature MicroRNAs within Plant Pre-miRNA Sequences

MicroRNAs (miRNAs) are a class of short, non-coding RNA that play regulatory roles in a wide variety of biological processes, such as plant growth and abiotic stress responses. Although several computational tools have been developed to identify primary miRNAs and precursor miRNAs (pre-miRNAs), very few provide the functionality of locating mature miRNAs within plant pre-miRNAs. This manuscript...

متن کامل

MatPred: Computational Identification of Mature MicroRNAs within Novel Pre-MicroRNAs

BACKGROUND MicroRNAs (miRNAs) are short noncoding RNAs integral for regulating gene expression at the posttranscriptional level. However, experimental methods often fall short in finding miRNAs expressed at low levels or in specific tissues. While several computational methods have been developed for predicting the localization of mature miRNAs within the precursor transcript, the prediction ac...

متن کامل

PlantMiRNAPred: efficient classification of real and pseudo plant pre-miRNAs

MOTIVATION MicroRNAs (miRNAs) are a set of short (21-24 nt) non-coding RNAs that play significant roles as post-transcriptional regulators in animals and plants. While some existing methods use comparative genomic approaches to identify plant precursor miRNAs (pre-miRNAs), others are based on the complementarity characteristics between miRNAs and their target mRNAs sequences. However, they can ...

متن کامل

Insights into role of microRNAs in cardiac development, cardiac diseases, and developing novel therapies

Objective(s): MicroRNAs (miRNAs) are a subfamily of small noncoding RNAs that play a variety of roles in regulating gene expression in nearly all organisms. They affect different biological pathways by post-transcriptionally regulating mRNAs. Aside from miRNAs’  role in maintaining cellular homeostasis, their perturbation is related to several pathologic states and dis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011